In the Claims: 



1 . (Currently Amended) A method for presenting an application in a plurality of 
modalities, comprising the steps of: 

retrieving a modality-independent document from one of local and remote storage^ 
wherein the modality-independent document is an intent-based document that describes user 
interaction with the application separate from application content and presentation : 

parsing the modality-independent document using parsing rules obtained from one of 
local or remote storage; 

converting the modality-independent document to a first intermediate representation that 
can be rendered by a speech user interface modality; 

converting the modality-independent document to a second intermediate representation 
that can be rendered by a GUI (graphical user interface) modality; 

building a cross-reference table by which the speech user interface can access 
components comprising the second intermediate representation; 

rendering the first and second intermediate representations in their respective modality; 

and 

receiving a user input in one of the GUI and speech user interface modalities to enable 
multi-modal interaction and control the document presentation. 

2. (Original) The method of claim 1, wherein the GUI and speech user interface 
modalities are synchronized in the document presentation. 

3. (Original) The method of claim 1, wherein the first intermediate representation is 
stored in local system memory for immediate rendering. 

4. (Original) The method of claim 1, wherein the step of converting the 
modality-independent document to the first intermediate representation comprises transcoding 
the modality-independent document to a speech markup script. 
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5. (Original) The method of claim 4, wherein the step of rendering comprises the step of 
deferred rendering of the speech markup script. 

6. (Original) The method of claim 4, wherein the speech markup script is stored on a 
local persistent storage device. 

7. (Original) The method of claim 4, wherein the speech markup script comprises VXML 
(Voice extensible Markup Language). 

8. (Original) The method of claim 1, further comprising the step of executing an 
applications program when a corresponding event call occurs within the modality-independent 
document. 

9. (Original) The method of claim 8, wherein the step of executing an applications 
program comprises updating existing grammar rules with data values returned from the 
applications program. 

10. (Original) The method of claim 8, wherein the step of executing an appUcations 
program comprises updating content values associated with a component of the 
modality-independent document using data values returned from the applications program. 

11. (Original) The method of claim 1, fiirther comprising the step of registering a 
program to be executed upon completion of a specified event. 

12. (Canceled) 

13. (Original) A program storage device readable by machine, tangibly embodying a 
program of instructions executable by the machine to perform the method steps of claim 1 . 
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14. (Currently Amended) A method for providing global help information when 
presenting a modality-independent document, the method comprising the steps of: 

preparing an internal representation of a structure and component attributes of the 
modahty-independent document, wherein the modality-independent document is an intent-based 
document that describes user interaction with the application separate from application content 
and presentation : 

building a granmiar comprising rules for resolving specific spoken requests; 
processing a spoken request utilizing the grammar rules; and 

presenting an aural description of the modality-independent document in response to the 
spoken request, wherein presenting an aural description of the modality-independent document 
comprises providing global help information by presenting document components, attributes, and 
methods of interaction. 

15. (Cancelled) 

16. (Original) A program storage device readable by machine, tangibly embodying a 
program of instructions executable by the machine to perform the method steps of claim 14. 

17. (Currently Amended) A method for providing contextual help information when 
presenting a modality- independent document, the method comprising the steps of: 

preparing an internal representation of a structure and component attributes of the 
modality-independent document, wherein the modality-independent document is an intent-based 
document that describes user interaction with the application separate from application content 
and presentation : 

building a grammar comprising rules for resolving specific spoken requests; 
processing a spoken request utilizing the grammar rules; and 

presenting an aural description of the components, attributes, and methods of interaction 
of the modahty-independent document in response to the spoken request to provide contextual 
help information. 
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18. (Original) The method of claim 17, wherein the step of building a grammar 
comprises the step of combining values obtained from data stored in one of local storage, remote 
storage, and a combination thereof, with values obtained from an analysis of the 
modality-independent document. 

19. (Original) A program storage device readable by machine, tangibly embodying a 
program of instructions executable by the machine to perform the method steps of claim 17. 

20. (Currently Amended) A method for providing feedback information when presenting 
a modality-independent document, the method comprising the steps of: 

preparing an internal representation of the structure and component attributes of the 
modality-independent document, wherein the modality-independent document is an intent-based 
document that describes user interaction with the application separate from application content 
and presentation : 

building a grammar comprising rules for resolving specific spoken requests; 

processing a spoken request and resolving the spoken request utilizing the grammar rules; 

obtaining state and value information regarding specified components of the document 
from the internal representation of the document; and 

presenting an aural description of the content values associated with document 
components in response to the spoken request to provide feedback information. 

21 . (Original) The method of claim 20, wherein the step of building a grammar 
comprises the step of combining values obtained from data stored in one of local storage, remote 
storage, and a combination thereof, with values obtained from analysis of the document. 

22. (Original) A program storage device readable by machine, tangibly embodying a 
program of instructions executable by the machine to perform the method steps of claim 20. 
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23. (Currently Amended) A method for aurally spelling out content values associated 
with components of a modality-independent document, the method comprising the steps of: 

preparing an internal representation of a structure and component attributes of the 
modality-independent document, wherein the modality-independent document is an intent-based 
document that describes user interaction with the application separate from application content 
and presentation : 

building a grammar comprising rules for resolving specific spoken requests; 
processing a spoken request utilizing the grammar rules; 

obtaining state and content value information regarding specified components of the 
document from the internal representation of the document; and 

presenting each character of the content value information requested in response to the 
spoken request. 

24. (Original) The method of claim 23, wherein the step of presenting each character of 
the content value information comprises the step of inserting pauses between each character of 
the content value information to be presented. 

25. (Original) The method of claim 23, wherein the step of building a grammar 
comprises the step of combining values obtained from data stored in one of local storage, remote 
storage, and a combination thereof, with values obtained from an analysis of the document. 

26. (Original) A program storage device readable by machine, tangibly embodying a 
program of instructions executable by the machine to perform the method steps of claim 23. 

27. (Currently Amended) A system for presenting an application in a plurality of 
modalities, comprising: 

a multi-modal manager for parsing a modality-independent docmnent to generate a 
traversal model that maps components of the modality-independent document to at least a first 
and second modality-specific representation, wherein the modality-independent document is an 
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intent-based document that describes user interaction with the application separate from 
appHcation content and presentation : 

a speech user interface manager for rendering and presenting the first modahty-specific 
representation in a speech modahty; 

a GUI (graphical user interface) manager for rendering and presenting the second 
modality-specific representation in a GUI modality; 

an event queue monitor for detecting GUI events; 

an event queue for storing captured GUI events; and 

a plurality of methods, that are called by the speech user interface manager, for 
synchronizing I/O (input/output) events across the speech and GUI modalities. 

28. (Original) The system of claim 27, wherein the methods for synchronizing I/O 
events comprise a first method for polling for the occurrence of GUI events in the event queue 
and a second method for reflecting speech events back to the GUI manager and posting speech 
events to the multi-modal manager. 

29. (Original) The system of claim 27, fiirther comprising a method for invoking 
user-specified programs that are specified in the modality-independent document. 

30. (Original) The system of claim 27, wherein the multi -modal manager comprises a 
main renderer that instantiates the GUI manager, the speech user interface manager, and a 
method for capturing GUI events. 

31. (Original) The system of claim 27, wherein the speech user interface manager 
comprises JSAPI (java speech application program interface). 

32. (Original) The system of claim 27, wherein the speech user interface manager 
comprises a VoiceXML browser. 
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33. (Original) The system of claim 32, further comprising a transcoder for generating a 
VoiceXML script from the modality-independent document. 
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